Deductive Biocomputing

نویسندگان

  • Jeff Shrager
  • Richard Waldinger
  • Mark Stickel
  • J.P. Massar
چکیده

BACKGROUND As biologists increasingly rely upon computational tools, it is imperative that they be able to appropriately apply these tools and clearly understand the methods the tools employ. Such tools must have access to all the relevant data and knowledge and, in some sense, "understand" biology so that they can serve biologists' goals appropriately and "explain" in biological terms how results are computed. METHODOLOGY/PRINCIPAL FINDINGS We describe a deduction-based approach to biocomputation that semiautomatically combines knowledge, software, and data to satisfy goals expressed in a high-level biological language. The approach is implemented in an open source web-based biocomputing platform called BioDeducta, which combines SRI's SNARK theorem prover with the BioBike interactive integrated knowledge base. The biologist/user expresses a high-level conjecture, representing a biocomputational goal query, without indicating how this goal is to be achieved. A subject domain theory, represented in SNARK's logical language, transforms the terms in the conjecture into capabilities of the available resources and the background knowledge necessary to link them together. If the subject domain theory enables SNARK to prove the conjecture--that is, to find paths between the goal and BioBike resources--then the resulting proofs represent solutions to the conjecture/query. Such proofs provide provenance for each result, indicating in detail how they were computed. We demonstrate BioDeducta by showing how it can approximately replicate a previously published analysis of genes involved in the adaptation of cyanobacteria to different light niches. CONCLUSIONS/SIGNIFICANCE Through the use of automated deduction guided by a biological subject domain theory, this work is a step towards enabling biologists to conveniently and efficiently marshal integrated knowledge, data, and computational tools toward resolving complex biological queries.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identifying amino acid residues in medium resolution critical point graphs using instance based query generation.

Instance Based Query Generation is defined and applied to the problem of recognising amino acid residues in medium resolution critical point graphs. The technique is an amalgamation of Relational Instance Based Learning and Frequent Query Discovery in First Order Logic. Instances are automatically constructed from a deductive database and first order association rules are derived from the insta...

متن کامل

Machine learning and deep analytics for biocomputing: call for better explainability.

The goals of this workshop are to discuss challenges in explainability of current Machine Leaning and Deep Analytics (MLDA) used in biocomputing and to start the discussion on ways to improve it. We define explainability in MLDA as easy to use information explaining why and how the MLDA approach made its decisions. We believe that much greater effort is needed to address the issue of MLDA expla...

متن کامل

Computational Approaches to Understanding the Evolution of Molecular Function.

The following sections are included:IntroductionOverview of ContributionsReferences.

متن کامل

Using multiple alignments and phylogenetic trees to detect RNA secondary structure.

We describe a statistical method to determine if a pair of columns in a multiple alignment of a homologous family of RNA sequences shows evidence of being base paired. The method makes explicit use of a given phylogenetic tree for the sequences in the alignment. It is tested on a multiple alignment of 16S rRNA sequences with good results.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PLoS ONE

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2007